Picture for Bingyin Mei

Bingyin Mei

LatentOmni: Rethinking Omni-Modal Understanding via Unified Audio-Visual Latent Reasoning

Add code
May 21, 2026
Viaarxiv icon

FashionMV: Product-Level Composed Image Retrieval with Multi-View Fashion Data

Add code
Apr 11, 2026
Viaarxiv icon